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Figure 1: BLAST result against NCBI non-redundant database using SEQ ID NO: 6 (the 
INSP 108 polypeptide). 



BLASTP 2.2.2 ( Jan-08~2002] 



Reference: Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= INSP108.pp 

(77 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,267,376 sequences; 405,046,914 total letters 

Searching done 

Sequences producing significant alignments: 

ref |NP_697019.1i defensin, beta 123; defensin, beta 23 [Homo sap., 
ref |XP_141520.1| similar to defensin, beta 123; defensin, beta 2.. 
gb | AAM93917.il defensin beta 124 (Homo sapiens] 

emb|CAB72350.2| dJ1018D12.3 (a putative novel protein) [Homo sap... 
ref 1NP_ 473453. 1| epididymus specific clone 42; chromosome 20 ope... 
ref |NP_660139.1I defensin beta 119; testis-specif ic beta-def ensi. . . 
sp|Q95LlO)D118__MACMU Beta-def ensin 118 precursor (Epididymal sec... 
ref |NP_631968.1| defensin beta 15 [Mus musculus] >gi 1 19171622 1 em. . , 
ref 1NP_689464.1| defensin, beta 106; defensin, beta 6 [Homo sapi... 
gb| AAN33114.il beta-def ensin 106 [Homo sapiens] 
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Figure 2: Alignment between INSP108 polypeptide sequence (SEQ ID NO:6) and 
defensin beta 123 (Homo sapiens). 



>re£ |NP_697019.1| defensin, beta 123; defensin, beta 23 [Homo sapiens] 
sp|Q8N688|D123_RUMAN Beta-def ensin 123 precursor (Beta-defensin 23) (DEFB-23) 
gb|AAM93916.1| defensin beta 123 [Homo sapiens] 
Length = 67 

Score = 50.8 bits (120), Expect « 4e-06 

Identities « 26/67 (38%), Positives - 33/67 (48%), Gaps - 10/67 (14%) 

Query: 6 VLLALWLNLLFYVPPGRSGPNVYIQKIFASCWRLQGTCRPKCLKNEQYRILCDTIHLCC 65 

+LL L VL LI* + PG + CW L G CR +C K E+ + C +CC 

Sbjct: 4 LLLTLTVLLLLSQLTPGGT QRCWNLYGKCRYRCSKKERVYVYCINNKMCC 53 



Query: 66 VNPKYLP 72 

V PKY P 
Sbjct: 54 VKPKYQP 60 
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Figure 3: Sig P cleavage site prediction for INSP108. 
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>INSP108.pp 

Prediction: Signal peptide 
Signal peptide probability: 1.000 
Signal anchor probability: 0.000 

Max cleavage site probability: 0.906 between pos. 24 and 25 
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Figure 4: BLAST result against NCBI non-redundant database using SEQ ID NO: 14 (the 
INSP109 polypeptide). 



BLASTP 2.2.2 [ Jan-08-2002] 



Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query* INSP109.pp 

(78 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,267,376 sequences; 405,046,914 total letters 

Searching done 



Sequences producing significant alignments: 
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dbj | BAC37510. 1 | unnamed protein product (Mus musculus] 

ref |NP_0 62702.11 defensin beta 4; beta defensin-4 [Mus musculus] .. . 

emblCAA08905.11 beta defensin-2 [Capra hircus] 

ref |NP_348802.1| Dncharacterized protein, homolog HI 12 4 4 from Ha 

gb | AAG10514 . 1 J AF288371__1 beta-def ensin 4 variant [Mus musculus] 
emb | CAD23115 . 1 1 blue cone opsin [Cottus kesslerii] 

ref |NP_689464.1| defensin, beta 106; defensin, beta 6 [Homo sapi... 

gb|AAN33114.1| beta-def ensin 106 [Homo sapiens] 

ref | XP_JL 63 302.1 1 hypothetical protein XP_1 63302 [Mus musculus] 
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Figure 5: Alignment between INSP109 polypeptide sequence (SEQ ID NO:14) and 
defensin beta 4 {Mus musculus). 

>ref|NPj)62702.1| defensin beta 4; beta defensin-4 [Mus musculus] 
sp P82019|BD04 ta MOOSE Beta-def ensin 4 precursor (BD- 4? (mBD-4)" 
gb|AAD38852.1|AF155882_l beta defensin-4 [Mus musculusl 

do" ^llll; 1 ^ 28747 V bet -^^sin 4 precursor Sus musculus] 
dbj BAB26051.il unnamed protein product [Mus musculus] 
db 3 |BAB26207.1j unnamed protein product [Mus musculus] 
Length = 63 

Score =32.7 bits (73) r Expect =1.1 

Identities - 19/56 (33%), Positives - 27/56 (47%), Gaps « 1/56 (1%) 

Query: 8 LLFFLVILLPSGKGMFGNDGVKVRTCTSQKAVCFFGCPPGYRWIAFC-HNILSCCK 62 
owa . * +++L S F TC + A+C+ CP +R I c H + CCK 

Sbjct: 6 LLFTFLLVLLS PLAAFTQI INN PI TCMTNGAICWGPC PTAFRQIGNCGHFKVRCCK 61 



WO 2004/063219 



PCT/GB2004/000101 



6/22 



Figure 6: Sig P cleavage site prediction for INSP109. 



>INSP109.pp 



SignalP-NN prediction (euk networks): INSP109.pp 



1.0 
0.8 
0.6 
3 0.4 



0.2 

0.0 I— 



III 



Cscojb 

S score 

Y score 



WNLCLSALLFFLV I L LP SGKQ^FGhDGVKVRTCTQQKA VCFFGCPP GYRWI AFCHN I LSCXXNvlTRFQPf ■ 
1 » 1 -J . J , 



10 



>INSP109.pp 

# Measure Position 



20 



30 40 
Posfilon 



50 



60 



70 



max. C 
max. Y 
max. S 
mean S 



22 
22 
7 

1-21 



length =70 
Value Cutoff 



0.849 
0.854 
0.994 
0. 953 



0.33 
0.32 
0.82 
0.47 



signal peptide? 
YES 
YES 
YES 
YES 



# Most likely cleavage site between pos . 21 and 22: GKG-MF 



SignalP-HMM prediction (euk models): INSP109.pp 



1.0 
0.8 
0.6 
0.4 
0.2 
0.0 



r— 1— 

\ / \ f\ 
\ / \? 1 


— 1 T- 1 1 . 

Cleavage prob. — 

n-region prob. ~ 

h-region prob. 

c-fsgion prob. 


- / \ /] 




— ^ — *.-rJ :\&.JL±_ 




MVLCLSALLFFLV I L LPSGKGMFG^DGVKVRTCTiSQKAVCFFQCPPGV1WI AFCHN 1 LSCCKNVITRFQPI 
[ L " 1 ' 1 1 I 



10 



20 



50 



60 



30 40 

„ „ Position 
>INSP109.pp 

Prediction: Signal peptide 
Signal peptide probability: 0.999 
Signal anchor probability: 0.001 

Max cleavage site probability: 0.619 between pos. 21 and 22 



70 



WO 2004/063219 PCT/GB2004/000101 

7/22 

Figure 7: Predicted nucleotide sequence of INSP108 with translation 



1 ttgttccaaa aggttcacta gccatgcagc tccccgtctc ttcaaagctg cggagagagt 

61 gactctccga tgagtcacag ctgcttcttt gctgattggt atggccacaa ggagcgtcct 

mat r s v 

=^ 

iNSPmfl-Fi 

121 cttggccctc gtggtcctta acttactctt ctatgttcca ccaggtagaa gtggacccaa 
llal vvl nil fyyp pgr s g p 

181 tgtctacata caaaaaatct ttgcttcatg ttggcgactg caaggtactt gccggccaaa 
nvyi q ki fas cwrl qgt crp 

241 atgtctaaaa aacgaacaat atcgtatttt gtgtgatact atacatttgt gctgtgtaaa 
k c 1 k neq yri lcdt ihl ccv 

301 cccaaaatat ttacctatac tgactgggaa atagttgtga gtacctgaaa gctgttgctg 
npkylpiltgk 

INRP10ft-R1 

361 atttcctctg ggaacccaga tccctctcag ttgcaccatt cgattaaaac aatggcttta 
421 gcctatcagt gttc 



Position and sense of PCR primers 
Underlined : signal sequence 
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Figure 8: Nucleotide sequence with translation of INSP108 PCR product cloned using 
primers INSP108-CP1 and INSP108-CP2. 



1 attggtatgg ccacaaggag cgtcctcttg gccctcgtgg tccttaactt actcttctat 

m atr svll alv vln llfy 

IN5tPinfUF1 

61 gttccaccag gtagaagtgg acccaatgtc tacatacaaa aaatctttgc ttcatgttgg 

vpp grs gpnv yiq kif ascw 

121 cgactgcaag gtacttgccg gccaaaatgt ctaaaaaacg aacaatatcg tattttgtgt 

rlq gtc rpkc lkrt eqy rile 

181 gatactatac atttgtgctg tgtaaaccca aaatatttac ctatactgac tgggaaatag 

d t i hlc cvnp kyl pil tgk 

< 

241 ttgtg iNSPma-Ri 



Position and sense of PCR primers 
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Figure 9: Map of pCR4-TOPO-INSP108 
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Figure 11: Map of expression vector pEAK12d 
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Molecule: 
File Name: 



PEAK12 d, 8760 bps DNA Circular 
pEAK12DEST.cm5 



Description: Mammalian cell expression vector (plasmid ID 1134 5) 
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Figure 12: Map of expression vector pDEST12.2 
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File Name: 
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Figure 13: Map of pDONR221-INSP108-6HIS 
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Figure 14: Map of pEAK12d-INSP108-6HIS 
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Figure 15: Map of pDEST12.2-INSP108-6HIS 
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Figure 16: Predicted nucleotide sequence of INSP109 with translation 



1 atgaacctct gtctttctgo attactcttc ttcctggtga tcttactgcc ttcaggaaaa 
m n 1 c 1 § a 1 1 f fly ill p s_g k 

61 ggtatgtttg ggaatgatgg agtcaaagtt cgcacctgca ctagccagaa agccgtatat 
— 2 mf gnd gvkv rtc tsq ka v c 

121 ttcttcgggt gtccgccagg atacaggtgg attgcgttct gccacaatat tctgtcttqc 
ff 9 cpp gyrw iaf chn ils c 

181 tgtaaaaata tgacacgttt tcaacccccg caagccaaag atccatgggt teat 
ckn m t r fqpp qak dpw vh 



Underlined= signal peptide 
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Figure 17: INSP109 coding exon organization in genomic DNA and position of PCR 
primers 
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Figure 18: Nucleotide sequence and translation of cloned INSP109 ORF 

1 atgaacctct gtctttctgc attactcttc ttcctggtga tcttactgcc ttcaggaaaa 
mnl els allf flv ill psgk 

61 ggtatgtttg ggaatgatgg agtcaaagtt cgcacctgca etagecagaa ageegtatgt 
gmf gnd gvkv rtc tsq kavc 

121 ttcttcgggt gtccgccagg atacaggtgg attgegttet gecacaatat tctgtcttgc 
f f g epp gyrw iaf chn iisc 



181 tgtaaaaata tgacacgttt tcaacccccg caagecaaag atccatgggt tcatta 
ckn mtr fqpp qak dpw vh 
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Figure 19: Map of pCR4-TOPO-rNSP109 
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Figure 20: Map of pDONR-INSP109-6HIS 
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Figure 21: Map of pEAK12d-INSP109-6HIS 
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Figure 22: Map of pDEST12.2-INSP109-6HIS 
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